I2R-NUS-MSRA at TAC 2011: Entity Linking

نویسندگان

  • Wei Zhang
  • Chew Lim Tan
  • Jian Su
  • Bin Chen
  • Wenting Wang
  • Zhiqiang Toh
  • Yanchuan Sim
  • Yunbo Cao
  • Chin-Yew Lin
چکیده

In this paper, we report the joint participation of I2R-NUS team and MSRA team in entity linking task for Knowledge Base Population at Text Analysis Conference 2011. I2R-NUS team submitted two results with the full system and the partial system for diagnosis purpose. Both results incorporate the new technologies: acronym expansion, instance selection and topic modeling proposed in our recent papers. In clustering step, three clustering algorithms: spectral graph partitioning (SGP), hierarchical agglomerative clustering (HAC) and latent Dirichlet allocation (LDA) are combined for the full system. The full system achieves a competitive F-score 0.8311. The partial system uses only Wikipedia Source to generate candidates for KB linking and only LDA for clustering , which leads to 0.813 Fscore. Although due to the time constrain, the combined result of I2R-NUS full system with MSRA KB linking result was not submitted, it shows 0.828 F-score afterwards.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NUS-I2R: Learning a Combined System for Entity Linking

In this paper, we report the joint participation of NUS and I2R team in Knowledge Base Population at Text analysis conference 2010. For Entity Linking, we analyze IR approaches and SVM classification in the disambiguation stage and develop a supervised learner for combining these approaches. The combined system performs better than the individual components and achieves results much better than...

متن کامل

MSRA at TAC 2011: Entity Linking

The Knowledge Base Population task aims at advancing the state of the art for systems that automatically discover information about named entities and then incorporate this information in a knowledge source. The overall task of populating a knowledge base is decomposed into two related tasks: Entity Linking, where names must be aligned to entities in the KB, and Slot Filling, which involves min...

متن کامل

ECNU: Brief System Description of Submission to Knowledge Base Population at TAC 2011

This paper briefly reports our submissions to the three tasks in TAC KBP 2011, i.e., Slot Filling (SF for short), Entity Linking (EL for short) and Cross-lingual Entity Linking (CEL for short).

متن کامل

THUNLP at TAC KBP 2011 in Entity Linking

Entity Linking is to link a name string from plain-text documents to the corresponding entry in given knowledge base. In this paper we demonstrate our entity linking system for TAC KBP 2011 Track. Our system implements pairwise and listwise learning to rank methods to create a ranking list of candidates with several kinds of features, including context similarity, term frequency, key entity ext...

متن کامل

HITS' Cross-lingual Entity Linking System at TAC 2011: One Model for All Languages

This paper presents HITS’ system for crosslingual entity linking at TAC 2011. We approach the task in three stages: (1) context disambiguation to obtain a language-independent representation, (2) entity disambiguation, (3) clustering of the queries that have not been linked in the second step. For each of these steps one single model is trained and applied to both languages, i.e. English and Ch...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011